URL-Enhanced Adaptive Page-Refresh Models
نویسندگان
چکیده
We study the refresh model required to keep an up to date copy of a web page. This has applications for more efficient and frequent crawling, in the case of search engines, and for higher hit rates for proxy servers with pre-fetching. Two main models have been proposed namely the uniform refresh model (Cho and Garcia-Molina, 2000) and the adaptive page refresh model (Edwards et al., 2001), with some debate as to the relative value of each model. In this work we show that adaptive page refresh models can be further improved by careful selection of initial page-refresh rates of newly added links as indicated by our page evolution studies showing that page-change rates (and consequently page-refresh rates) are dependent on top-level domain, category and page depth.
منابع مشابه
Technology Brief: The Shadow Uniform Resource Locator: Standardizing Citations of Electronically Published Materials
Citation of scientific materials published on the Internet is often cumbersome because of unwieldy uniform resource locators (URLs). The authors describe a format for URLs that simplifies citation of scholarly materials. Its use depends on a simple HTML device, the "refresh page." Uniform citation would follow this format: [Author I. Title of article. http:// domain/year/month-day(e#).html]. Th...
متن کاملPrioritize the ordering of URL queue in Focused crawler
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource discovery efficiently. For a crawler it is not an simple task to download the domain specific web pages. This unfocused approach often shows undesired results. Therefore, several new ideas have been proposed, among them a key technique is focused crawling which is able to crawl particular topical...
متن کاملClass-based Cache Management for Dynamic Web Content
Caching dynamic pages at a server site is beneficial in reducing server resource demands and it also helps dynamic page caching at proxy sites. Previous work has used fine-grain dependence graphs among individual dynamic pages and underlying data sets to enforce result consistency. This paper proposes a complementary solution for applications that require coarse-grain cache management. The key ...
متن کاملAdaptive features of a hypermedia system using bookmark per Web page
In this paper, we present a technique called bookmark per Web page by which users can build a new link structure in a hypermedia. It allows users to create their own links for Web pages. Our aim is to provide an adaptive hypermedia with additional adaptive features by considering properties which the author of the hyperspace did not provide when the hyperspace was designed. In addition, a gener...
متن کاملURL Forwarding and Compression in Adaptive Web Caching
Web caching is generally acknowledged as an important service for alleviating focused overloads when certain web servers’ contents suddenly become popular. Cooperative caching systems are more effective than independent caches due to the larger collective backing store that cooperation creates. One such system currently being developed at UCLA, Adaptive Web Caching (AWC), uses an application-le...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005